Variable selection in general multinomial logit models

نویسندگان

  • Gerhard Tutz
  • Wolfgang Pößnecker
  • Lorenz Uhlmann
چکیده

The use of the multinomial logit model is typically restricted to applications with few predictors, because in high-dimensional settings maximum likelihood estimates tend to deteriorate. In this paper we are proposing a sparsity-inducing penalty that accounts for the special structure of multinomial models. In contrast to existing methods, it penalizes the parameters that are linked to one variable in a grouped way and thus yields variable selection instead of parameter selection. We develop a proximal gradient method that is able to efficiently compute stable estimates. In addition, the penalization is extended to the important case of predictors that vary across response categories. We apply our estimator to the modeling of party choice of voters in Germany including voter-specific variables like age and gender but also party-specific features like stance on nuclear energy and immigration.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selection of multinomial logit models via association rules analysis

In this research, we propose a novel approach for a multinomial logit model selection procedure: specifically, we apply association rules analysis to identifying potential interactions for multinomial logit modeling. Interaction effects are very common in reality, but conventional multinomial logit model selection methods typically ignore them. This is especially true for higher-order interacti...

متن کامل

Multinomial logit models with implicit variable selection

Multinomial logit models which are most commonly used for the modeling of unordered multi-category responses are typically restricted to the use of few predictors. In the high-dimensional case maximum likelihood estimates frequently do not exist. In this paper we are developing a boosting technique called multinomBoost that performs variable selection and fits the multinomial logit model also w...

متن کامل

Modeling the behavior of disordered taxi drivers of Tehran for choosing passenger and destination

In this study, the manner of private taxis drivers has been investigated for choosing passenger and destination from a fixed point. Therefore, two models called Multinomial and Nested Logit Models have been utilized. The information gained by scrolling in 2016 is the input data, which are in the format of revealed preference, acquired by the verbal interview in Vanak Square in Tehran (Iran). Ba...

متن کامل

Working Paper Series Categorical Data Categorical Data

Categorical outcome (or discrete outcome or qualitative response) regression models are models for a discrete dependent variable recording in which of two or more categories an outcome of interest lies. For binary data (two categories) probit and logit models or semiparametric methods are used. For multinomial data (more than two categories) that are unordered, common models are multinomial and...

متن کامل

Multinomial Logistic Regression Ensembles

This article proposes a method for multiclass classification problems using ensembles of multinomial logistic regression models. A multinomial logit model is used as a base classifier in ensembles from random partitions of predictors. The multinomial logit model can be applied to each mutually exclusive subset of the feature space without variable selection. By combining multiple models the pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 82  شماره 

صفحات  -

تاریخ انتشار 2015